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Abstract. We use Series' Markovian coding for words in Fuchsian groups and the 
Bowen-Series coding of limit sets to prove an ergodic theorem for Cesaro averages of 
I spherical averages in a Fuchsian group. 
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1. Introduction 

1.1. An ergodic theorem for surface groups. Let g > 2 and let T be the fundamental 
group of a surface of genus g endowed with a set of generators 

Q 

satisfying the standard commutator relation 

[\[ai,bi] = 1. 

^ For g G T, let \g\ stand for the length of the shortest word in the generators ([!]) 

\Q representing g. Let 

£J S(n) = {g G T : \g\ = n} 

be the sphere of radius n in T, and let K n be the cardinality of S{n). A special case of 
the main result of this note is the following pointwise ergodic theorem for T: 

Theorem A. Let T act ergodically on a probability space (X, u) by measure-preserving 
transformations, and, for g 6 F, let T g be the corresponding transformation. Then for any 
^ (f G L\{X,u) we have 

d N-i f 

n=0 n g eS(n) J x 

both v -almost surely and in Li(X, u) as N — )• oo. 

Our main result, Theorem [Bj is that a similar ergodic theorem applies much more 
generally to any finitely generated Fuchsian group with a suitable choice of generating set, 
see Section [2.5| for a precise statement. For numerous examples of actions to which this 
theorem applies, see [HI [13]. Theorem [B] is derived from a pointwise ergodic theorem for 
free semigroups from [6] whose formulation we shortly recall. For previous literature on 
pointwise ergodic theorems for actions of various classes of non-amenable discrete groups, 
see for example O O [El QUI EED El E] . 
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2 ALEXANDER I. BUFETOV AND CAROLINE SERIES 

1.2. Ergodic theorems for free semigroups. Let (X, v) be a probability space and let 
T\, . . . , T m : X — > X be z/-preserving transformations. 

Denote by W m the set of all finite words in the symbols 1, . . . , m: 

W m = {w = wi . . . w n , Wi £ {1, . . . , m}}. 

The length of a word w is denoted by \w\. For each w G W m , w = w\ . . .w n , define the 
transformation T w by the formula 

T w = Tw\ ■ ■ ■ T Wn . 

Now let A be an m x m-matrix with non-negative entries. For each w G W m , w = 
w\ . . . w n , set 

A(w) = A WlW2 . . . A Wn _ lWn . 

Now let ip be a measurable function on X and for each n = 0, 1, ... , consider the 
expression 

( 3 ) s nV = 



T.\ w \=n A (w) 

where we assume that the denominator does not vanish. 



Furthermore, consider Cesaro averages of the "sphere averages" and set: 

N-l 

N 



1 N ^ 

(4) 4(<p) = -^2s^). 

n=0 

Definition 1. A matrix A with non-negative entries is called irreducible if for some n > 
all entries of the matrix A + A 2 + • • • + A n are positive. 

Definition 2. A matrix A with non-negative entries is called strictly irreducible if A is 
irreducible and AA T is irreducible (here A T stands for the transpose of A). 

A measurable subset Y C X will be called Ti, . . . ,T m - invariant if its characteristic 
function \Y satisfies T\xy = ••• = T m XY = Xy- Denote by B the c-algebra of all 
Ti, . . . , T m - invariant subsets of X. Given ip £ L\(X, u), denote by K(ip\B) the conditional 
expectation of ip with respect to B. 

We now recall Corollary 2 in [6]: 

Proposit ion 1.1. Let A be a strictly irreducible mxm matrix. LetT\, . . . ,T m be measure- 
preserving self-maps of a probability space (X,u). Let B be the a-algebra of T\, . . . ,T m - 
invariant measurable sets. Then for each tp £ L\(X, v) we have cfj(T)(p — > K(tp\B) almost 
everywhere and in L\(X, v) as N — > oo. 



Theorems |A| and [B] will be derived from Proposition 1.1 using the Markov coding for 
Fuchsian groups introduced in 0] , see also [TBI EH] • 
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2. Markov Maps for Fuchsian Groups 

Let r be a finitely generated non-elementary Fuchsian group acting in the hyperbolic 
disk D. The Markov coding, originally introduced in [3] to encode limit points of T as 
infinite words in a fixed set of generators, also gives a canonical shortest form for words in 
r. The coding is defined relative to a fixed choice of fundamental domain TZ for T, which 
we suppose is a finite sided convex polygon with vertices contained in D U <9D, such that 
the interior angle at each vertex is strictly less than n. By a side of TZ we mean the closure 
in D of the geodesic arc joining a pair of adjacent vertices. We allow the infinite area case 
in which some adjacent vertices on <9B are joined by an arc contained in <9D; we do not 
count these arcs as sides of TZ. We assume that the sides of 1Z are paired; that is, for each 
side s of 1Z there is a (unique) element e £ T such that e(s) is also a side of 7Z and such 
that 1Z and e(7Z) are adjacent along e(s). (Notice that this includes the possibility that 
e(s) = s, in which case e is elliptic of order 2 and the side s contains the fixed point of 
e in its interior. The condition that the vertex angle is strictly less than tt excludes the 
possibility that the fixed point of e is counted as a vertex of TZ.) 

We denote by dTZ the union of the sides of TZ, in other words, dTZ is the part of the 
boundary of TZ inside the disk D. Each side of dTZ is assigned two labels, one interior to TZ 
and one exterior, in such a way that the interior and exterior labels are mutually inverse 
elements of T. We label the side s C dR interior to TZ by e if e carries s to another side 
e(s) of TZ, while we label the same side exterior to TZ by e _1 , see Figure [l] With this 
convention, TZ and e _1 (7?.) are adjacent along the side whose interior label is e, while the 
side e(s) has interior label e _1 . 

Let Tq be the set of labels of sides of TZ. The labelling extends to a T-invariant 
labelling of all sides of the tessellation T of D by images of TZ. The conventions have been 
chosen in such a way that if two regions gTZ, hTZ are adjacent along a common side s, then 
h~ l g 6 Tq and the label on s interior to gTZ is h~ 1 g, while that on the side interior to 
hTZ is g~ 1 h. Suppose that O is a fixed basepoint in TZ and that 7 is an oriented path in 
D from O to gO, g 6 T, which avoids all vertices of T , passing through in order adjacent 
regions TZ = goTZ, giTZ, . . . , g n TZ = gTZ. Then the labels of the sides crossed by 7, read in 
such a way that if 7 crosses from gi~\TZ into g(TZ we read off the label e« = g~\gi of the 
common side interior to giTZ, are in order e\, e2, • • • , e n so that g = e\e<i ■ ■ ■ e n . This proves 
the well known fact that Tq generates T, see for example [1]. In particular, if we read 
off labels round a small loop round vertex v of TZ, we obtain the vertex cycle at v with 
corresponding vertex relation e\ti . . .e n = id. The generating sets implicit in Theorem [B] 
are obtained in this way. 

Following [3], the fundamental domain TZ is said to have even corners if for each side 
■s of TZ, the complete geodesic in D which extends s is contained in the sides of T ■ This 
condition is satisfied for example, by the regular 4g-gon of interior angle ir/2g whose sides 
can be paired with the generating set of Theorem [A] to form a surface of genus g. If the 
path 7 from O to gO described above is a hyperbolic geodesic and TZ has even corners, 
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Figure 1 . Labelling the sides of the fundamental domain 1Z. The label e 
appears interior to TZ on the side of TZ adjacent to the region e~ l TZ. 

then the corresponding representation of g by the word e\e2 ■ ■ ■ e n is shortest in (T, To), 

see [am]. 

Let \&TZ\ denote the number of sides of 1Z. In [I] we showed that, subject to certain 
restrictions if \&1Z\ < 4, one can associate to any fundamental domain with even corners an 
alphabet A and a transition matrix P, so that A is mapped by a finite-to-one map it onto 
To, in such a way that the obvious extension of ir to a map from the set of finite sequences 
with alphabet A and allowed transitions defined by P to the group V is surjective. We call 
this map n, the alphabet map. A crucial feature of the alphabet map is that every word 



in its image is shortest in the word metric on (r, To), see |19j and Theorem 2.1 below. In 
particular ir preserves length, that is, the image under ir of a sequence of n symbols in A 
is an element g £ V of shortest length n relative to the generators Tq. Thus to sum over 
g £ r for which \g\ = n as required by Theorems |A| and |B| we need only sum over all 
allowable finite words of length n in the alphabet A. 

It follows that in order to apply Proposition |1.1[ we need only check whether the 
transition matrix P is irreducible and strictly irreducible and that n is, in a precise sense, 
almost bijective to T. Our main work is to show that (subject to some restrictions if 



\dlZ\ < 5) this is indeed the case, see Propositions 2.8, 2.11 and 2.13 

Notice that the statements of Theorems [A] and [B] only involve enumerating words in 
(r, To) and are independent of the precise geometry of 7Z. Thus for example one can replace 
the regular 4g-gon with any hyperbolic octagon whose interior angles sum to 2tt and with 
generators given by the same combinatorial pattern of side pairings. We elaborate on 



this observation in Section 2.1.2, where we explain why requiring a generating set which 
comes from a fundamental domain with even corners is much more general that it appears, 
leading to the general statement in Theorem [B] 

2.1. The Coding. We briefly review the coding as explained in [19] . in which it appears 
in a simpler and more general form than in the original version [1] . 

By abuse of notation from now on we think of the tessellation T as the union of its 
sides, precisely T = U{g(dlZ) : g £ T}. We assume throughout our discussion that 1Z 
has even corners, so that T is a union of complete geodesies in D. Let V C dO be the 
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collection of endpoints of those geodesies in T which meet dlZ (crucially this includes lines 
which meet dlZ only in a vertex of 1Z). The points of V partition dD — V into connected 
open intervals /; we denote the collection of all these intervals by X. 

Let s = s(e) be the side of 1Z whose exterior label is e. The extension of s into a 
complete geodesic lies in T, separating D into two half planes, one of which contains the 
interior of 1Z and one of which contains the interior of eJZ, see Figure[T] Let L(e) denote the 
open arc on dO bounding the component which contains elZ, see Figure [3} Each interval 



I G X is contained in L(e) for at least one and at most two sides of dlZ (see Lemma 2.4 



below for a full justification of this fact). For each I G X, choose e = e(J) G Tq such that 
I C L(e). We define a map /: <9D - V -)■ <9D by f{x) = e{I)~ l {x) for x G I. Extend 
/ to a (discontinuous) possibly two valued map on V in the obvious way. As observed 
in [U H9], the map / is Markov in the sense that for any J G X, f(I) (1 implies that 

/(J) D J. To see this, it is clearly sufficient to show that f(V) C V, independently of 
which of the possibly two choices we make for /. So suppose £ G V is an endpoint of an 
interval I C L(e) and that /(£) = e(/) -1 (£). Write e = e(I). From the definitions, £ is 
an endpoint of a geodesic i which meets the closure of the side s = s(e) of 1Z. From the 
definition of the labelling, e _1 (s(e)) = s(e~ 1 ) is also a side of 1Z. Hence /(£) = e _1 (£) is 
an endpoint of e _1 (t) and e~ l {t) must meet the closure of s(e _1 ), hence /(£) G "P. 

2.1.1. T/ie alphabet map. We define our alphabet by setting „4 = X, in other words, .4 
is the collection of all the intervals defined by the subdivision of dD by points in V . We 
define a transition matrix P = (pij) by pj t j = 1 if /(/) D J and = otherwise. Let 
Si? denote the set of finite sequences h ■ ■ ■ h n with I.; r G X such that pi i ^ > for all 
r = 0, . . . ,n — 1. Thus consists of all allowable finite sequnces in the subshift on the 
symbols I G X with transition rule specified by P. 

The alphabet map tt: X — > Tq is the map which associates to each I G X the element 
e G To corresponding to our choice of e for which / C L(e), equivalently for which f\j = 
e~ l . This extends in an obvious way to a map it : T,p — > T. Recall that a product of n 
elements of Tq is shortest (with respect to the generators Tq ) if it cannot be expressed as a 
product of less than n elements of To- An important feature of the coding is the following 
result which follows from Theorem 5.10 and Corollary 5.11 in [19], see also Theorem 2.8 

in m- 

Theorem 2.1. Suppose that 1Z has even corners and that either 

(i) \d1Z\ > 5 or 

(ii) \dTZ\ = 4 and, if in addition all vertices of 1Z lie in D, then at least three geodesies 
in T meet at each vertex or 

(Hi) \&JZ\ = 3 and at least one vertex of 1Z is on dD. 
Then the alphabet map tt is surjective to T. Moreover every word in tt(T,f) is shortest with 
respect to the geometric generators Tq associated to 1Z as above, and each element g G T 
has a unique representation in 7r(Sjr). 

In what follows, we always assume that 1Z satisfies one of the hypotheses of The- 
orem 2.1 Uniqueness means that any g G T has a unique representation as a word 
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ej x . . . ei n in the image of ir; however we may have two distinct sequences Ij 1 . . . Ij n and 
Ij x . . . Ij n with vr(/j r ) = vr(/j r ) for all r. A key point in the proof of Theorem [B| is to show 
that 7r is nevertheless almost injective, precisely: 

Proposition 2.2. Let g G T. T/ien vr -1 ^) < 2 and #{g: \g\ = n,^ -1 ^)! > l}/n ->■ as 
n —7- oo. 

2.1.2. Ubiquity of even corners. The condition that 7Z have even corners may seem very 
special. However our result depends only on the combinatorics of the generating set, so 
that regions which do not have even corners may still have side pairings which satisfy the 
required conditions. More precisely, we note the following facts: 

(1) Many standard fundamental domains, for example the symmetrical 4g-gon for a 
surface of genus g, and the standard fundamental domain for SX(2,Z), do have 
even corners. Moreover the condition depends only on the geometry of 1Z and not 
on the pattern of side pairings, so that for example the symmetrical 4g-gon with 
opposite sides paired would work equally well. 

(2) A simple observation going back to Koebe shows that the group corresponding to 
any closed hyperbolic surface has a fundamental domain with even corners. To 
see this we have only to choose smooth closed geodesies for the sides of 1Z. This is 
always possible; see [2! for a picture. 

(3) If T has no torsion but contains parabolics then one can always choose a funda- 
mental polygon with all vertices on <9D. Such a polygon certainly has even corners 
(and in fact T is then a free group). 

(4) We showed in J3] that every finitely generated Fuchsian group is quasiconformally 
equivalent to one which has a fundamental domain with even corners. The de- 
formation can be chosen to preserve the combinatorial pattern of sides and side 
pairings of 1Z. Since our results only depend on the group and not on the specific 
hyperbolic structure, it is sufficient to work with the deformed group for which the 
fundamental domain does have the even corner property. 

(5) We show in [16j that, subject to hypotheses essentially the same as those of The- 
orem 2.1 one can always find a partition of dD and a map / whose combinatorial 
properties are identical to those which pertain when 1Z has even corners. One 
could work directly in this setting, but with the disadvantage that without the 
geometry of 1Z at ones disposal it would be much harder to follow what is going 
on. 

(6) Despite the above comments, one should be clear that our results depend heavily 
on the choice of 1Z and the geometrical generating set To- 

Remark 2.3. Let £ denote the space of all infinite sequences Ii Ii 1 ■ ■ ■ allowed by the 
transition matrix P. Let A(T) denote the limit set of T. We showed in |16j that, modulo 



the exceptional cases excluded by Theorem 2.1 , the obvious map defined by "/-expansions" 



induces a surjection vr(S) -4 A(T) which is injective except on a countable number of points 
where it is two-to-one. (The exceptional points are essentially the endpoints of infinite 



special chains, see [TpJ Proposition 4.6 and Section 2.4 below. 
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If r contains no parabolics, then we show in |16j that Hausdorff measure in dimension 
5, where 5 is the exponent of convergence of T, lifts to a Gibbs measure on S. In this case, 
Hausdorff measure is the so-called Patterson measure on A(T). In particular, if B/T is 
compact then Lebesgue measure on dO is Gibbs. In |17j we studied random walks on the 
Cayley graph of T. We showed that if the transition probabilities are finitely supported 
on r, then hitting distribution on A(r) is Gibbs. Note however that the obvious actions 
of T on these spaces are not measure preserving, so our ergodic theorem does not apply 
in this context. 

2.2. Irreducibility. In this section we show that the transition matrix P associated to the 
map / is irreducible, which unfortunately means delving in some detail into its mechanics. 
The starting point is the following simple but crucial result which is Lemma 2.2 in [JJ. 

Lemma 2.4. Suppose that \dTZ\ > 3. Then any two distinct complete geodesies in T 
which meet dTZ are either disjoint in D U <9D, or intersect in a vertex ofTZ. 

The most interesting case is when lines t, t' C T meet dTZ in the two vertices at the 
opposite ends of some side s, neither being coincident with s. The lemma asserts that t and 
t' are disjoint. Note also that there is a choice in the definition of / only if L(e)nL(d) ^ 0. 



It follows from Lemma 2.4 that this occurs only if s(e) and s(d) are adjacent sides of TZ. 




C(v) 



Figure 2. Intervals round a vertex v of TZ. In the case shown, n(v) = 4, 
numbers indicating the levels. All intervals except those of level and 1 
are contained in X. The crown C(v) is the union of all intervals of levels 
2,...,n(v). 

Now we establish some notation. Let v be a vertex of TZ. Let n(v) be the number 
of geodesies of T which meet at v (so that 2n(v ) copies of TZ meet at v). The endpoints 
on effi) of the n{v ) complete geodesies in T which meet at v partition their complement 
in <9D into 2n(v) open intervals, each of which is (the interior of the closure of) a union 
of intervals J £ I. We assign to each of these new intervals a level, denoted lev(.), as 
follows. The interval L{d) n L{e) bounded by the endpoints of the extensions of sides 
s(d) and s(e) of TZ which meet at v has level n{v). The interval opposite L(d) n L(e) has 
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level 0. The remaining intervals going round in both directions from to n(v) have in 
order levels 1,2,..., n(v) — 1, see Figure [2] It follows from Lemma 2.4 that the intervals 
of all levels except and 1 are also intervals in the set I, and moreover that there is a 
choice in the definition of / only on intervals of level n(v). Finally define the crown of 
v, denoted C(v), to be the interior of the closure of the union of the intervals of levels 
n(v),n{y) — 1, ... , n(v) — 2 at v, see Figure [2] Thus C(v) is an open interval on dO. Notice 
that if v G dB then C(v) = 0. 




Figure 3. Intervals subtended by a side of TZ. If the vertices of s(e) are 
v, w then A(e) is the subset of L(e) which is contained in neither C{v) nor 
C(w), while M(e) consists of all of L(e) except the 'ambiguous' top level 
intervals at either end of L(e). Thus f\M(e) = e_1 while f\L(e)r\L(d) can have 
value either e" 1 or dT 1 . 

For each e G Tq define de to be the two vertices of TZ at the ends of the side s(e), 
and let C(e) = C(v) U C(w) where de = {v,w}. Also let M(e) = L(e) - U d jt e L(d) and 



A(e) = L(e) - C(de), see Figure [3j If \&R\ > 3 then Lemma \2A\ implies that .4(e) 7^ 0. 
Note that if x € M(e) then f(x) = e~ l (x). 

We are finally ready to start our proof that the transition matrix P is irreducible. 
In what follows we shall say that a constant depends only on TZ, when we mean that it 
depends on TZ and the combinatorial pattern of side pairings of TZ. We introduce various 
such constants and denote all of them by K. 

Lemma 2.5. Suppose \dTZ\ > 3. Then there exists K £ N, depending only on TZ, such 
that for any I € I, we have f r (I) D A(e) for some r with < r < K and some e G Tq. 

Proof. If / is not already of the form A(e) for some e G To, then it is in the crown of some 
vertex v of TZ, and hence lev(I) = r > 1. Suppose that I C L(e) and that f\j = e _1 . 
Then / carries s(e) to the side e~ 1 (s) = s(e~ 1 ), so that f(v) = e~ l v is a vertex of ^(e^ 1 ). 
Following the discussion at the start of Section [2j the cyclic order of labels around the 
vertices v and f(v) is the same, and by inspection one sees that f{I) is an interval of level 
r — 1 at e~ 1 v. Since the sets A(e) are contained in the union of level 1 sets associated to 
all vertices of TZ, the result follows. □ 



Lemma 2.6. Let e G Tq. Then 
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(i) f(M(e)) D C(v) for any vertex v ^ de 1 . 

(ii) /(M(e)) D A(d) for any d ^ e _1 . 

Proof. By definition, ,/W( e ) = e _1 so that / carries s(e) to ^(e" 1 ). Moreover e -1 carries 
L(e) to the complement in dO of -L(e _1 ). We need to find the image under e _1 of M(e) C 
L(e). Let V,W denote the endpoints of L(e _1 ) on <9D and let V±, W\ denote the points in 
V adjacent to V,W and outside L(e~ 1 ). Then e _1 (M(e)) is the interval on efflJ bounded 
by Vi,Wi and not containing L(e _1 ). This covers all of efflJ except for A(e~ l ) and parts 
ofCide- 1 ). □ 

Lemma 2.7. Suppose that \d1Z\ > 3. T/ien i/iere exists K G N, depending only on 1Z, 
such that for any e G To, and any I £ 1 which is contained in M(e), we /jawe / r (4(e)) D 7 
/or some r < K. 

Proof. By definition, /u(e) = e_1 > an d e_1 maps e7£ (which is the copy of 7£ adjacent to 
7£ along s = s(e)) to 7?.. The endpoints of A(e) are the endpoints on dD of the extensions 
of the sides of elZ adjacent to s, see Figure [3} Thus the endpoints of /(-4(e)) are the 
endpoints on of the extensions of the sides of 1Z adjacent to e~ 1 (s) = s(e _1 ). If these 
sides are s(d),s(d'), then provided that e ^ e -1 , we see that /(4(e)) is the complement 
in <9D of L(e _1 ) U L(d) U L(d'). This gives the result (with r = 1) in the case in which 
s(e) is neither equal to nor adjacent to s(e _1 ). Since in both of the exceptionl cases e is 
necessarily elliptic, this in particular proves the result whenever T is torsion free. 

Now suppose that s(e) and s(e _1 ) are adjacent with common vertex v G D, so that 
e is elliptic with fixed point v. Reasoning as above, we see that /(4(e)) covers M(x) for 
any side s(x) neither equal nor adjacent to s(e _1 ). Fix one such x, which is possible since 
\dll\ > 3. Since by our assumption s(e) and s(e~ 1 ) are adjacent, we have x / e,e _1 . 



By Lemma 2.6 (i), f(M(x)) covers all crowns except C(w) for w £ dx . In particular, 
f(M(x)) D C(v). Let I el be the level 2 interval contained in C(v) n L(e). Then /(J) 
is the level 1 interval in C(v) Pi L(e), which is equal to L{e) — C(v ). Thus suitable powers 
of / applied to C(v) cover all of L(e) D M(e) which gives the result. 

Now consider the case e = e _1 . First assume that dR has at least 5 sides. (Remember 
we count the edge containing the fixed point of e as one side.) In this case, there exist 
x,y, distinct from each other and from e, such that /(4(e)) covers M(x) and M(y). 
Now f(M(x)) covers 4(e) since e ^ x _1 . In addition, f(M(x)) and f(M(y)) together 
cover all crowns except for those crowns C(w) with w E dx~ l n dy~ l . This implies that 
f(M(x)) U f(M(y)) covers C(v) for v G <9e, which gives the result. 

Finally, suppose that 5i? has 4 sides. In this case, /(4(e)) only covers M{x) for x the 
side opposite e. As usual, f(M(x)) covers 4(e). If x = x~ l then f(M(x)) covers M(e). 
Otherwise, x~ l is adjacent to e and f{M(x)) covers M(y) where y is the fourth side of 
dR (opposite x^ 1 ). In this case we have y = y~ l . Letting v,w be the vertices of s(e) 
adjacent to s(x~ 1 ) and s(y) respectively, we see that f(M(y)) covers C(v) and f(M(x)) 
covers C(w). The result follows. □ 



10 



ALEXANDER I. BUFETOV AND CAROLINE SERIES 



Proposition 2.8. The Markov chain P is irreducible. 

Proof. We have to show that there exists K, depending only on 1Z, such that for any 
/, J E I, we have f r (I) D J for some < r < K. 

Assume first that \dR\ > 3. By Lemma 2.5 we may as well assume that I = A{e) 
for some e £ Tq. By Lemma 2.7 it will be enough to show that images of M(e) cover 
OH. By Lemma 2.6, f(M(e)) covers all crowns except C(de~ 1 ) and all sets A(x) with 
x ^ e . Since \dlZ\ > 3 we may choose x,y distinct from each other and from e and 



e _1 such that /(M(e)) D A(x) U A(y). By Lemmas 2.7 and 2.6, there exists r < K such 
that f r (A(x)) covers A(e~ 1 ) and all crowns except C{dx~ l ). Likewise f s (A(y)) covers all 
crowns except C(dy^ 1 ) for some s < K. Now by choice x,y and e are distinct and so 
C(dx- 1 ) n Cidy- 1 ) n Cide" 1 ) = 0. The result follows. 

Finally, we have to consider the case in which \dR\ = 3. Notice that this is the only 
case in which it is possible that A(e) = 0. By hypothesis, at least one vertex of 7Z is on 
dD. There are only three possible cases: 

(i) 7Z has three vertices on dD; 

(ii) 1Z has two vertices v , w on <9D. The side joining v , w is paired to itself by an order 
two elliptic x; the remaining two sides are paired to each other by an elliptic e 
with fixed point at the third (finite) vertex u; 

(iii) 1Z has one vertex v on <9B. The two sides meeting at v are paired to each other by 
e, the third side is paired to itself by an order two elliptic x. 

Case (ii). Set B{e ± ) = L(e ± ) - C{u). Note that /(5(e ± )) = L(x) and f(L(x)) = 
<9B — L(x). Furthermore, for each / C C(u) it is clear that f r (I) = (B(e^)) for some 
r < n(u). This proves the result. 

Case (iii). Let u be the finite endpoint of side e and let J(u) = C(u) — L(e). Define 
J(w) similarly relative to w the finite endpoint of e . Note that f(J(u)) D A{e _1 ) and 
f(J(w)) D A(e). It follows that the image of every interval in C(u) U C(v) eventually 
covers either A(e) or A(e~ 1 ). Further, a bounded image of A(e) covers L(e) U J(u) and a 
bounded image of A{e~ 1 ) covers L(e~ 1 ) U J(iu). The result follows. 

Case (i) is easier and is left to the reader. □ 



2.3. Strict irreducibility. We now investigate strict irreducibility of the transition ma- 
trix P. It is well known and easy to see that P is strictly irreducible if the equivalence 
relation ~ onl generated by / ~ J if /(/) n /( J) ^ has just one equivalence class. We 
show that if \dH\ > 4 then / is always strictly irreducible, while if \&JZ\ < 4 the map / 
may or may not be strictly irreducible depending on the precise arrangement of 1Z and 
its side pairings. In particular, the continued fraction map associated to the standard 
fundamental domain for SL(2, Z) is not strictly irreducible. 

Lemma 2.9. The Markov chain associated to any choice of Markov map f for the fun- 
damental domain \z\ > 1,-1/2 < $tz < 1/2 is not strictly irreducible. 

Proof. The continuations of the sides of dlZ through the two vertices at (1 ± y/3i)/2 meet 
the real axis K in the 7 points —2, —1, —1/2, 0, 1/2, 1, 2 which partition M into 8 intervals 
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which we number 1, 2 . . . , 8 in order from left to right, thus for example 3 denotes the 
interval (—1,-1/2). The map / is defined as: 

f(x) = x + 1 for x G 1 U 2, 

f(x) = -l/x for x £ 3U4U5U6, 

f(x) = x — 1 for x £ 7 U 8. 

There is a choice for / on the overlap regions 4 and 5: for definiteness we have taken the 
usual choice f(x) = —l/x for i £ 4U5 which is associated to the continued fraction map. 

It is easy to write down the transition matrix P for /. We find 1—)- 1 U 2, 2 — )• 3 U 4, 
3 ->■ 7, 4 -> 8, 5 ->■ 1, 6 -> 2, 7 ->• 5 U 6, and 8 ->■ 7 U 8. From this we easily see that 
there are four equivalence classes under ~: {3,4,8}, {5,6,1}, {2} and {7}. This gives 
the result. We remark that even had we made the other choice for / on either of 3 and 6, 
then there are still at least two equivalence classes. □ 

Another interesting example is furnished by the group r = (a, b, c : a 2 = b 2 = (?) 
where 1Z is the ideal triangle with vertices 0, l,oo and a,b,c G P5L(2,Z) are elliptics of 
order two with fixed points at i, (1 + i)/2 and 1 + i respectively. In this case one checks 
that / is strictly irreducible. 

We base our general proof of strict irreducibility on the following lemma. 

Lemma 2.10. Suppose that there exists a family of open intervals Jo, . . . , J m G I such 
that 

(i) U™ /( J i) covers dO-V; 
(n) f(Ji) n /(Ji+i) + fori = 0, . . . ,m - 1. 

Then the Markov chain associated to the Markov map f is strictly irreducible. 

Proof. By assumption (i) , every interval / G T is equivalent to at least one of the Ji . By 
assumption (ii), Jj ~ Jj + i for all < i < m. The result follows. □ 

Proposition 2.11. Suppose that \d1Z\ > 5. Then the Markov chain associated to the 
Markov map f is strictly irreducible. 



Proof. We show the sets A(e),e G To, satisfy the requirements of Lemma 2.10 Suppose 
that the extensions of the sides of 1Z adjacent to e _1 meet <9D in points V and W. Then 
f(A(e)) is the interval between V and W and not containing L(e _1 ). Since \dTZ\ > 5, this 
set of intervals overlaps round <9B in the required manner. □ 

If \d1Z\ = 4, then / may or may not be strictly irreducible. For example, suppose that 
1Z has two opposite vertices v, w G <9D while the other opposite pair are in D. Suppose the 
sides adjacent to v are paired, and equally the sides adjacent to w. Then one can verify 
directly that ~ has two equivalence classes. The idea is that the points v and w divide 
dH) into two halves E and F say. One checks easily that the image of every interval I is 
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contained either completely in E, or completely in F, so the intervals whose images fall 
in these two halves cannot be equivalent. 

On the other hand, if 1Z has 4 sides all of which lie in D, then by hypothesis we assume 
that at least three geodesies in T meet at each vertex. One can check that the images of 



the level one intervals at each vertex cover <9D in the manner required by Lemma 2.10 
We have already studied similar phenomena when \&R\ = 3. 



2.4. The alphabet map. Finally we prove Proposition |2.2| We begin by recalling some 
further terminology from [2[ SI [19] . 

Let ej . . . ei n be a word in the generators Tq. Since Tq consists of side pairing transfor- 
mations of 1Z, the regions 1Z and ei r 1Z, and more generally ej . . . ei r _ 1 TZ and ej . . . e^TZ for 
< r < n, have a common side. The word ej . . . ei n is called a cycle if in the tesselation 
of D by images of 1Z, the regions 1Z, ei 1Z, e^e^TZ, . . . , ej . . . ei n TZ are arranged in order 
round a common vertex w£D, see Section [2j (According to this definition, a single letter 
e is always a cycle provided that at least one of the vertices de is in D.) Cycles ej . . . ej a , 
ej . . . ej t are called consecutive if there exists e £ Tq such that ej . . . ei a e and e~ 1 ej . . . e Jt 
are both cycles, see [2] for more details. This means that ej . . . e« s is a cycle at u and that 
e JO . . . ej t is a cycle with the same orientation at w, where v and w are the endpoints of 
the side e _1 of 1Z, see Figure |4j 




Figure 4. Consecutive cycles. 

The word ej . . . is called a special chain if it consists of a sequence of consecutive 
cycles B1B2 ■ ■ ■ B n at vertices vi,...,v n such that l?i has length at most n{v\) — 1, B n has 
length at most n[v n ) and -Bj has length exactly n(vi) — 1 for 1 < i < n. The geometrical 
meaning of this definition is that the sequences of copies of 1Z corresponding to a special 
chain all touch a common hyperbolic line t C T, all except possibly the first or last one 
lying on the same side of t. 

Remark 2.12. Special chains are intimately connected to the solution of the word problem 
for Fuchsian groups given in [16^ [T8[ [2] . Special chains are shortest words and two shortest 
words V = e{ . . . ei N and W = ej . . . ej N with e« r 7^ e Jr for all r represent the same 
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element of V only if either both V and W are single cycles of length n(v), or are both 
sequences of consecutive cycles of lengths n(v{) — 1, n{v2) — 1, ... , n(v n -\) — 1, n(v n ) and 
n(vi), n{v2) — 1, . . . ,n(v n -i) — l,n(v n ) — 1 respectively. In the latter case, the sequences 
of copies of corresponding to the words ei . . . ei N and ej . . . ej N meet along a common 
line in T ■ 

Proposition 2.13. Let it : X — > T be the alphabet map. Then 7r _1 (ei . . . ei N ) < 2 with 
equality if and only if ej . . . ei N ends in a special chain. 



Proof of Proposition 2.2. This is an immediate corollary of Proposition 2.13 Notice that a 
special chain is completely specified by its initial two letters (which determine the direction 
of the cycle at the first vertex) and the length of the initial cycle. It follows that there 
exits a constant K depending only on 1Z such that the total number of special chains of 
length exactly n is bounded by K, independent of n. Since the total number of words in 
r of length n grows exponentially with n, and since a special chain can be continued to 
arbitrary length, the result follows. □ 



We establish several lemmas before proving Proposition 2.13 



Lemma 2.14. Suppose that vr(/j ... Ij n ) = ir(Ij . . . Ij m )- Thenn = m andn(Ii r ) = 7r(Ij r ) 
for r = 0, . . . , n. Moreover if Ii k ^ Ij k for some k then I{ r ^ Ij r for any r > k. 

Proof. Suppose that 7r(Ij .../,„) = ir(Ij . . . Ij m )- Since the images of both sequences are 
shortest, n = m. Moreover because of unique representation in V by sequences in the 



image of tt, see Theorem 2.1 we have vr(/i r ) = 7r(Ij r ) for r = 0, . . . , n. 



Now suppose that Ii k ^ Ij k . By definition, f\i = tt(I) 1 . Since vr(/j fc ) = Tr(Ij k ), we 
see that / is injective on Ii k U Ij k and the result follows. □ 

Lemma 2.15. Suppose that 7r(Ij . . . Ij n ) = vr(Ij . . . Ij n ) with Ii ^ Ij , and suppose that 
< r < n. Then: 

(i) if Ii r C C(v) n L(e) for e £ To and v £ de, then either Ij r C C(v) or Ij r = A(e); 

(ii) if Ii r = A(e) for e £ Tq then Ij r is adjacent to A(e); 

(Hi) if Ii r ,Ij r C C[v) n L(e) for e £ Tq and some vertex v S de, and if lev(Ij r ) < 
lev(Ii r ) = k and r + k < n, then f k ~ 1 (Ii r ) = Ii r+k _ 1 = A(d) for some d G Tq. 

Proof. Assertion (i): Let w be the other vertex in de. If the result is false, then Ij r C 
C(w)nL(e). Let s(x) and s(y) be the sides of 1Z adjacent to s(e _1 ). Note that /|/. = e _1 
so that f{C(v) n L{e)) C L(x), say, and f(C(w) n L(e)) C L(y). 



By the hypotheses of Theorem 2.1 either \dlZ\ > 3 or 1Z has a vertex at oo. In 



both cases L(x) and L(y) are disjoint. In the first case this is clear by Lemma 2.4 
For the second, observe that we may assume that both vertices in 9e _1 are in D, since 
otherwise at least one of C(y) and C(w) is empty and there is nothing to prove. This 
forces vr(/j r+1 ) / ir(Ij r+1 ) contrary to hypothesis, which gives the result. 
Assertion (ii): Suppose first that e ^ e^ 1 . Observe that the image under e _1 of any 
interval in I C L(e) but not adjacent to A(e) is contained in C(de~ l ) and is thus contained 
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in L(e _1 ) UL(x) UL(y) where as above s(x), s(y) are the two sides of 1Z adjacent to s(e _1 ). 
On the other hand, the image under e" 1 of A(e) is outside L(e _1 ) U L(x) U L{y). Thus 
provided L(x) and L(y) are disjoint, ir(J) ^ {e,x,y} for any J £ X contained in f(A(e)). 
The result follows as above if \dlZ\ > 3. The special case |<97*!.| = 3 is easily treated 
separately. 

Finally suppose that e = e _1 . If / C C{de) then /(/) C C(de) but f{A{e)) is outside 
L(e) which is impossible. 

Assertion (Hi): The map / decreases level and at each stage with t < k, h r+t and Ij r+t 
are in the crown of a common vertex. At step k — 1, Ii r+k _ 1 has level 1 so that by (i), 
^v +fc _i = A(d) for some d £ Tq. □ 



Proof of Proposition 2.13[ Suppose that 

n{ J io •••4J = n(Ij ■■■ I j n ) 



Without loss of generality we may as well assume that Ii ^ Ij . By Lemma 2.14 Ii r ^ Ij r 
for any r > 0. 

Suppose first that r < n and that Ij r C C{v ) n L(e) has level k > 1 in the clockwise 
direction starting at the highest level interval at vertex v. (The proof if the interval is in the 
anticlockwise direction is similar; obviously the direction of all cycles are then reversed.) 
Then h r+1 has level k — 1 at vertex e~ x v and is outside L(e _1 ). Thus f{Ii r ) C L(e') 
where s(e') is the side of 1Z adjacent to s(e~ 1 ) in clockwise order round dlZ. Therefore 
e _1 e' _1 = 7r(/j r )7r(Ij r+1 ) is an anticlockwise cycle at e~ 1 v. 

Inductively, it follows that vr(/j r )7r(/j r+1 ) . . ■7r(Ii r+k _ 1 ) is an anticlockwise cycle. Fur- 
ther, by Lemma 2.15 (iii) we see that h r+k _ 1 = A{d) for some d € Tq and that i Jr+fc „ 1 is 



adjacent to h r+k _ 11 where d 1 = 7r(I ir+k _ 1 ). 

Let the sides of dlZ in clockwise order from s(d) be s(d), s(c), s(b) (so that the interior 
labels of sides in clockwise order are c _1 , b^ 1 ). Let v be the common vertex of s(d) 



and s(c) and let w be the common vertex of s(c) and s(b). From the proof of Lemma 2.15 
(ii), it follows that Ij r+k is necessarily the highest level interval at v , and that ir(Ij r+k ) = c. 

Now 7r(ij r+fc _ 1 )7r(/j r+fc ) = d~ 1 c is an anticlockwise cycle at v, hence so also is 
7r(Ii r )Tr(Ii r+1 ) . . . 7r(Ji r+fc _ 1 )c. Furthermore c~ 1 ir(Ij r+k ) = c~ l b is an anticlockwise cycle at 
w. This means the cycles 

^{hMhr+i) ■ ■ ■ ^(lir+t-i) and T(I jr+k ) = vr(J ir+ J 
are consecutive. Moreover , , and , , are both contained in the crown at w, in fact 

i r+k Jr + k > 

h r+k is the level n(w) — 1 interval in the crown at w, adjacent to Ij r+k going around 
clockwise. 

Now the first part of the argument repeats so that ir(Ij r+k ) is (assuming n is sufficiently 
large) the first term in an anticlockwise cycle of length of length n(w) — 1. A similar 
argument in the case Ii = A(e) completes the proof. □ 



2.5. An Ergodic Theorem for Fuchsian Groups. As before, let T be a finitely gen- 
erated non-elementary Fuchsian group acting in the hyperbolic disk D, and assume that 
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a fundamental domain 1Z for T has even corners and satisfies \&R\ > 5. As before, let Tq 
be the generating set corresponding to 1Z. For g £ T, let \g\ be the length of the shortest 
word in Tq representing T, and for n £ N, let 

S(n) = {g G T : \g\ = n} 

be the sphere of radius n in T. Finally, let K n be the cardinality of S(n). Observe that 
K n grows exponentially and so, by the Borel-Cantelli Lemma, the contribution of "non- 



injective" elements of Proposition 2.2 to the spherical averages is negligible. Propositions 



LllpO and 2.11 now imply 



Theorem B. Let T act ergodically on a probability space (X, u) by measure-preserving 
transformations, and, for g 6 T, let T g be the corresponding transformation. Suppose Fq 
is a geometric set of generators associated to a Markov partition as in Section \^ and 
suppose either that \dlZ\ > 5, or, if \dlZ\ < 5, that the associated transition matrix is 
strictly irreducible. Then, measuring word length with respect to the generators Tq, for 
any if £ L\{X, v) we have 

N-l 

n=0 n geS(n) £ 

both v -almost surely and in Li(X, v) as N — > oo. 
Theorem [B] implies special case Theorem \K\ 
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